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Abstract. Using rail-to-rail (R-R) swing analog circuits has become almost mandatory in the design of low supply 
voltage circuits. In this paper, a new architecture for constant-g m rail-to-rail input stages is presented. The design 
features a less than 5% deviation in g m over the entire range of the input common-mode voltage. Furthermore, a 
new structure for folded cascode amplifier based on the use of a floating current source is presented. By employing 
these techniques, a low -power operational amplifier (op-amp) with 100 MHz unity-gain bandwidth, 106 dB gain, 
60° phase margin, 2.65 V swing, and 6.4 nV/^/Hz input-referred noise with rail-to-rail input common-mode range 
is realized in a 0.8 /xm CMOS technology. This amplifier dissipates 10 mW from a 3 V power supply. 
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1. Introduction 

Operational amplifiers are among the most widely used 
functional blocks for high-level analog and mixed- 
signal integrated circuits. The overall performance of 
many circuits and systems depends mainly on the qual- 
ity of the op-amp used in their design. High-speed op- 
amps with R-R input common-mode range have a wide 
range of applications in high-speed and low-voltage 
signal processing circuits. 

At large supply voltages, there is a tradeoff among 
speed, gain and power. Signal swing becomes yet an- 
other performance metric to be considered when de- 
signing op-amps at low supply voltages [1]. For high- 
swing op-amps operating at low supply voltages, out- 
put swing is not usually a problem because class-A or 
class-AB output stages or even folded cascode archi- 
tectures can drive loads close to the power supply rails. 
However, designing high-swing input stages remains a 
challenging task. 

In order to have a R-R input common-mode range, 
a traditional approach is to use two complementary 
differential pairs to form the input stage (Fig. 1) [2]. 
The basic idea is to have at least one of the input differ- 
ential pairs active over the entire input common mode 
voltage, Vcm- However, this circuit suffers from some 
drawbacks. The total input transconductance, g mT , 



is given by the sum of the transconductances of the 
NMOS and PMOS differential pairs. When Vcm is at 
the midway of supply voltages, both NMOS and PMOS 
differential pairs are active. As a result, g m j could 
be much larger compared to when Vcm is near one of 
supply voltages. Therefore, the deviations in g mT as a 
function of Vcm can be as much as 100% (Fig. 2). Fur- 
thermore, at large signal regime, the large signal output 
current is much larger when Vcm is halfway between 
the supply voltages. This means that the slew-rate 
of a conventional single-stage or two-stage amplifier 
with this input stage will be a function of Vcm- 
These undesirable effects complicate the frequency 
compensation and result in harmonic distortion [3]. 

Limitations of the circuit depicted in Fig. 1 pro- 
vided an incentive for analog designers to create input 
stage circuits that have a constant input transconduc- 
tance over the whole range of Vcm [3-10]. As will 
be explained in Section 3, using these input stages in 
single-stage op-amps results in poor tradeoff between 
power consumption and speed of the op-amp. Indeed, 
almost all of the R-R constant-^,,, operational ampli- 
fiers have been realized in two- or three- stage config- 
urations, which result in limited bandwidth and speed 
for the amplifier. 

In this paper, a new architecture for constant-g m 
R-R input stages is presented that has less than 5% 
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Fig. 1 . Typical rail-to-rail CMOS input stage. 
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Fig. 2. g mT versus V C M- 



deviation in g m over the entire range of Vcm- Further- 
more, a new structure for folded cascode amplifier is 
presented that allows designing a power-efficient R- 
R input single-stage op-amp [11]. This paper is orga- 
nized as follow: input stage and its operation are ex- 
plained in Section 2. In Section 3, the limitations of 
using R-R input stages in single-stage op-amps is de- 
scribed and subsequently a new architecture for folded 
cascode amplifier based on the use of a floating current 
source which mitigates those limitations is presented. 
In Section 4, results are presented to illustrate the ef- 
fectiveness of presented approaches. Section 5 is the 
conclusion. 



2. Rail-to-Rail Constant g m Input Stage 

Based on the MOS square law equation (7 = 
K(W/L)(V GS - V,) 2 ), if the W/L ratio of a MOS tran- 
sistor is kept constant and its current is increased by a 
factor of 4, its g m will be doubled. This fact is the prin- 
ciple for most of the constant-g m input stages presented 
so far [3-8]. The principal drawback of this approach 
is revealed in CMOS submicron processes in which the 
square law equation in a MOS transistor is no longer 
valid and g m is less dependent on the transistor cur- 
rent. That is, the current increment factor should be 
higher than 4, i.e. 6 or 7, to double the g m of an MOS 
transistor. As a result, when Vcm is near power supply 
rails, power consumption increases considerably. This 
makes the design of the current summation (folded cas- 
code) circuit complicated. 

There are some other structures [9, 10], which do 
not operate based on the MOS square law equation. 
However, their input transconductance are subject to 
large variations. The operation of the novel architec- 
ture presented here is not based on the MOS square 
law equation. Besides, it maintains a nearly constant 
g m in the whole range of Vcm and shows superior per- 
formance compared to other structures. 

The idea of this technique is to activate another pair 
similar to the active pair when the opposite pair has lost 
required gate bias for proper operation. As a result, at 
the extreme input voltage ranges where one input pair 
has lost the required gate bias for proper operation, two 
similar pairs of other polarity generate signal current 
in parallel. Consequently, the input transconductance 
is doubled. To implement this approach, as shown in 
Fig. 3, three differential pairs of each polarity are used. 
All the PMOS differential pairs have equal W/L ra- 
tios. All the NMOS differential pair transistors are also 
identical. 

When Vcm is at the midrange of supply voltages, 
amongst the NMOS transistors, just pairs MN1-MN2 
and MP1-MP2 generate signal current. In this region of 
operation, the current from the current source 7^3 flows 
through transistors MN5 and MN6 and sinks all the tail 
current of the pair MP3-MP4, resulting in the pair be- 
ing off. On the other side, the current of the source 
7p3 flows through transistors MP5 and MP6 and sup- 
plies all the tail current of the pair MN3-MP4, resulting 
in MN3-MP4 switching off. Hence, only four devices 
MN1, MN2, MP1, and MP2 generate signal current 
and g mT is equal to the sum of input NMOS transistor 
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Fig. 3. New constant-g,„ rail to rail input stage. 



transconductance, g mn , and input PMOS transistor 
transconductance, g mp . 

Assuming that the nominal values of the g m of 
NMOS and PMOS pairs are equal, g,„„ — g mp — g,„, 
then g m r is equal to 2g m , Besides, there is a total cur- 
rent of 27 available for the current summer when the 
circuit is in the slew-limited mode. 

When Vcm is near Vdd, none of the PMOS transistors 
have a sufficient Vqs to remain active. Since there are 
no other paths for tail current sources Ip\ and Ipi, 
these current sources are deactivated. Therefore, the 
tail current source 1^2 is not supplied from elsewhere 
and flows through the pair MN3-MN4. Therefore, this 
pair turns on and works in parallel with the main pair 
MN1-MN2. As a result, g m j is equal to 2g mn which is 
equal to 2g m . There is also a total current of 2 / available 
in the limiting situation. 

At the other extreme point of Vcm, similarly, 7jv3 is 
pushed to the cutoff state and Ip2 flows through MP3 
and MP4. Consequently, the equivalent transconduc- 
tance is 2g mp — 2g m , There is also a total current of 21 
available in the slew-limited regime. 

If due to transistor mismatches, 7^2 and Ip2 exceed 
7 slightly, or 7#3 and Ipj become slightly smaller than 
/, the pairs MN3-MN4 and MP3-MP4 will turn on 
when Vcm is in the midway of supply voltages. This 



g mT before improvement 
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Fig. 4. The principle of reducing the g m t deviations in new R- 
R input stage. The solid and the dotted lines are the g m r and its 
components before and after the improvement, respectively. 



undesired activation of pairs MN3-MN4 and MP3- 
MP4 increases g m j and enhances the g m j deviations. 
By setting the value of current sources 7p3 and 7^3 
slightly higher than 7, i.e. 7 + A7, this problem can be 
eliminated. 

In this circuit, by handling the W/L ratio and Vgs 
of the transistors realizing the tail current sources, the 
g m r deviation can be further reduced. The concept of 
decreasing the g m j deviation is shown in Fig. 4. The 
solid and the dotted lines are the g mT and its compo- 
nents before and after the improvement, respectively. 
The overhangs in the g m j curve are because of the large 
slope in the g m of differential pairs in takeover regions. 
As illustrated in Fig. 4, as the slopes of the curves are 
reduced, the g mT curve becomes smoother. To reduce 
this slope, i.e. the slope of the g m - Vcm curve of a dif- 
ferential pair in takeover regions, the effective voltage, 
Vbs.sat, of the tail current source transistor should be in- 
creased. That is, the effective voltages of the transistors 
which realize the current sources I N1 and I P1 should be 
increased. This can be done simply by reducing their 
W/L ratios and increasing their Vgs- 

By cascoding the tail current sources 7^2, In3, 
Ip2, and 7^3, the deviations in g m j are further re- 
duced. The post-layout simulation result of the g m of 
the input stage is shown in Fig. 5. With this topol- 
ogy, the g m j deviations could be less than 5%, how- 
ever, since six differential pairs have been used to re- 
alize the R-R input stage, the input capacitance of 
this op-amp is somewhat considerable, which may 
not be acceptable in some applications. On the other 
hand, this large input capacitance results in smaller 
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input referred noise, which is desirable in many 
applications. 



3. Adaptive Biased Current Summation Circuit 

Another important block in an op-amp with R-R input 
common-mode range is the current summing circuit. 
The conventional approach for realizing this circuit is 
shown in Fig. 6. However, when this circuit is used 
in single-stage amplifiers, some problems are caused 
in the frequency compensation of the amplifier. These 
problems can be addressed as follows with reference 
to Fig. 6: 
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The current of NMOS input devices, I N , changes 
between zero for Vcm = Vss an d more than 2I„ for 
Vcm = Vdd; where /„ is the NMOS input stage bias 
current for when Vcm is at midrange of supply voltages. 
As a result, the bias current of Ml and Ml, Ib, not 
only should be able to supply the bias current Ip of 
transistors M3 to M6,but also should be high enough 
to handle the large changes in the current of NMOS 
input devices (from zero to 2I„). 

Note that when Vcm is at the midway of supply volt- 
ages or close to Vss, In is reduced, resulting in a consid- 
erable increases in I F . As a result, the transconductance 
and output resistance of transistors M3 to M8 change 
considerably, leading to large variations in pole-zero 
locations and also the open-loop gain of the op-amp. 

These large variations in pole-zero locations compli- 
cate the frequency compensation and prevent the op- 
timum usage of power in order to enhance the band- 
width. Here we can have a glance at the pole-zero loca- 
tions and gain of this amplifier. The poles and zeros of 
the op-amp depicted in Fig. 6 are located at following 
frequencies: 
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Fig. 6. Conventional method for biasing current summation circuit. 



where Cx, Cy, Cm and Cout denote the total capac- 
itances at nodes X, Y, M and OUT respectively. g m( , 
g mn and g mp denote the transconductances of transistor 
Mi, the NMOS input stage and the PMOS input stage, 
respectively, and ^? out is the output resistance of the 
op-amp obtained from: 

#OUT = (gm\r \(r 01 || r 0N )) II (gmtf '06<> '08 II '"op)) 

where r oi is the output resistance of transistor Mi. As 
it is obvious in the above equations, any variations in 
the bias current of the transistors considerably change 
the frequency response and output resistance as well as 
the open-loop gain of the op-amp. 

These variations in the bias current of the cur- 
rent summation transistors are not very important in 
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two- or multi-stage amplifiers, which use R-R input 
stage. This is mainly due to the fact that in two- or 
multi-stage amplifiers the voltage swing at the output 
node of the first stage is not a restrictive factor. There- 
fore, the W/L ratio of the output transistors of the first 
stage should not be high. This results in small parasitic 
capacitances at nodes X, Y, and M. Consequently, the 
poles located at these nodes are at fairly high frequen- 
cies and their effects can be safely ignored. Indeed, in 
these amplifiers the bandwidth is restricted to the poles 
of the second or other stages, which inherently are at 
lower frequencies compare to the poles located at nodes 
X, Y and M. 

However, for single-stage amplifiers, the output node 
of the current summation circuit is the output node of 
the op-amp. As a result, the voltage swing at this node 
determines the op-amp output voltage swing. Hence, 
the W/L ratio of the devices Ml to M8 should be 
high, resulting in high parasitic capacitances at nodes X 
and Y. Therefore, the small-signal parameters of these 
devices affect the frequency response of the amplifier 
considerably. In order to optimize the power consump- 
tion, the variations in the pole-zero locations due to the 
bias current variations should be minimized as much 
as possible. 

From detailed analysis, it becomes clear that the lo- 
cation of the poles, P 2 , P3, ^4,5 and zeros, Z\, Z23 is 
directly related to the transconductance of transistors 
M3-M6. In fact, the transconductance and output re- 
sistance of these transistors have the most important 
role in the pole-zero locations and gain of the op-amp. 
Therefore, stabilizing the quiescent current of these 
transistors helps in optimizing the frequency compen- 
sation and reducing the harmonic distortion of the 
amplifier. 

For this reason, both NMOS and PMOS cascoded 
transistors are used as current mirrors. Note that the 
current value of the current mirror changes adaptively 
by the changes in the input differential pair currents. 
The remained problem is to bias the current summation 
circuit. 

The first approach is to use two independent current 
sources as depicted in Fig. 7. The principal drawback 
of this approach is that the noise of bias current sources 
of the current mirrors directly contributes to the noise 
of the amplifier. This is due to the unity current gain 
between bias current sources and the drain currents of 
the input transistors. Any mismatch in the bias current 
sources will also increase the offset of the amplifier 
[6]. 




20 JH p 



hK 



Fig. 7. A simple approach for biasing current summation circuit. 



Using a floating current source between the drains of 
transistors M3 and M5, as shown in Fig. 8, can allevi- 
ate the mentioned problem. Besides, the bias current of 
transistors M3-M6 stays nearly constant. The circuit 
realization of the floating current source with the com- 
plete diagram of the entire amplifier is shown in Fig. 9. 
The value of this floating current source is determined 
by the MOS translinear loops Ml, M9, M12, Mil and 
Ml, M10, M13, M14. The floating current source has 
been used for the design of class AB amplifiers previ- 
ously [6], however, in this paper, it has been employed 
to bias the current summation circuit. 




—v n 



Fig. 8. Using a floating current source for biasing the current sum- 
mation circuit. 
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fig. 9. Complete schematic of the amplifier: the floating current source is realized by transistors M9 to M18. There are two gain-boosting 
amps, X and Y. The circuit realizations of the amplifiers X and Y are shown in right hand of the figure. 



Gain boosting has also been employed to ensure suf- 
ficient gain for the amplifier. Indeed, by two auxiliary 
folded cascode amplifiers, the output resistance of the 
amplifier increases considerably without any degrada- 
tion of frequency response [12]. The main parameter 
in designing the auxiliary amplifiers is their unity-gain 
bandwidth which should be about half of the frequency 
of the folding poles. That is, the unity-gain bandwidth 
of the auxiliary amplifiers X and Y should be chosen 
about half of the frequency of the poles Pi and P3, 
respectively [13]. 

4. Amplifier Specifications 

The proposed single-stage amplifier with constant- 
g,„ R-R input stage has been fabricated in a 0.8 /im 
double-poly, double-metal CMOS process provided by 
AMS Corporation. The core of the op-amp is shown 
in Fig. 10. This amplifier was used as a block in the 
design of a novel high frequency sampled-data filter, 
which operates based on neural network algorithm. The 
occupied die area is about 0.1 mm 2 . 

The experimental results show that this amplifier 
consumes a total power of 10 mW from a 3-V supply, 



while its input offset voltage is about 10 mV. The mea- 
sured 3 dB bandwidth in the non-inverting unity-gain 
feedback configuration is about 100 MHz. Input volt- 
age swing is from negative rail to positive rail, while 
the output swing is 2.65 V. 

The results of post-layout simulations obtained from 
rigorous HSPICE and Cadence simulators revealed that 
the g,„of the input stage, as shown in Fig. 4, has a de- 
viation of less than 5% over the entire range of Vcm- 



7 



77 






Fig. 10. Chip micrograph of the amplifier. 
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Table 1 . Op-amp characteristics with a 3- V Supply 
voltage and 5pF capacitive load. 



Parameter 



Value 



Die area 


0.1 mm 2 


Power Consumption 


lOmW 


Input voltage swing 


R-R 


Output voltage swing 


2.65V 


3 dB Bandwidth in noninverting 


100 MHz 


unity gain feedback 




Input offset voltage 


10 mV 


Gain* 


102 dB 


Phase margin* 


60° 


Settling time 0.1* 


15 nS 


Settling time 0.01* 


30 nS 


Slew rate* 


150 Y//J.S 


CMRR @ 1 KHz* 


121 dB 


Input noise voltage* 


6.4 nV/^Hz 


g m variation* 


5% 



*Simulated results 
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The variations of the unity-gain bandwidth and phase 
margin versus Vcm are shown in Figs. 1 1 and 12, re- 
spectively. These figures reveal that frequency response 
is robust enough in terms of the changes in Vcm- Fig- 
ure 13 shows the simulated frequency response of the 
op-amp with a 5 pF capacitive load when Vcm = 1 -5 V. 
It shows that the unity-gain frequency is about 1 00 MHz 
(which is in agreement with measured results). 

Figure 14 shows the step response of the amplifier 
in unity gain feedback configuration. The 0.1% and 
0.0 1 % settling times of the amplifier are 15 ns and 30 ns, 
respectively. Besides, the slew-rate of the amplifier is 
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1.5 pF. 
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Fig. 14. Small signal step response of the op-amp. 
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150 V//LtS, and the input referred noise is 6.4nVA v /Hz. 
The amplifier characteristics are summarized in 
Table 1. 

5. Conclusion 

In this paper, a new architecture for R-R constant- 
g m input stages is presented. The architecture is not 
based on the MOS square law equation, and has supe- 
rior performance compared to other R-R architectures. 
Additionally, based on the use of a floating current 
source, a new architecture is presented for designing 
single-stage operational transconductance amplifiers. 
The floating current source biases the output transistors 
of the amplifier with minimum changes in the pole-zero 
locus. Employing these new architectures, a low power, 
high swing, high speed, and high gain single-stage op- 
erational transconductance amplifier is designed and 
implemented in a 0.8 fim double poly double metal 
CMOS process which consumes less than lOmW from 
a 3-V supply. 
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